Analysis of the cDNAs of hypothetical genes on Arabidopsis chromosome 2 reveals numerous transcript variants.
نویسندگان
چکیده
In the fully sequenced Arabidopsis (Arabidopsis thaliana) genome, many gene models are annotated as "hypothetical protein," whose gene structures are predicted solely by computer algorithms with no support from either expressed sequence matches from Arabidopsis, or nucleic acid or protein homologs from other species. In order to confirm their existence and predicted gene structures, a high-throughput method of rapid amplification of cDNA ends (RACE) was used to obtain their cDNA sequences from 11 cDNA populations. Primers from all of the 797 hypothetical genes on chromosome 2 were designed, and, through 5' and 3' RACE, clones from 506 genes were sequenced and cDNA sequences from 399 target genes were recovered. The cDNA sequences were obtained by assembling their 5' and 3' RACE polymerase chain reaction products. These sequences revealed that (1) the structures of 151 hypothetical genes were different from their predictions; (2) 116 hypothetical genes had alternatively spliced transcripts and 187 genes displayed polyadenylation sites; and (3) there were transcripts arising from both strands, from the strand opposite to that of the prediction and possible dicistronic transcripts. Promoters from five randomly chosen hypothetical genes (At2g02540, At2g31270, At2g33640, At2g35550, and At2g36340) were cloned into report constructs, and their expressions are tissue or development stage specific. Our results indicate at least 50% of hypothetical genes on chromosome 2 are expressed in the cDNA populations with about 38% of the gene structures differing from their predictions. Thus, by using this targeted approach, high-throughput RACE, we revealed numerous transcripts including many uncharacterized variants from these hypothetical genes.
منابع مشابه
Identification and Characterization of LHCB1 Co-Suppressed Line in Arabidopsis
To explore the function of light-harvesting complex protein (LHCP) in Arabidopsis growth and development, the Leclere and Bartel seed collection was screened. In this collection randomly cloned cDNAs are expressed under the CaMV35S promoter. A pale green line has been identified and characterized in more details. Analysis of the inserted cDNA in the pale green line showed it encodes LHCB1 prote...
متن کاملCloning and Expression Analysis cf Two Photosynthetic Genes, PSI-H and LHCB1, Under Trehalose Feeding Conditions in Arabidipsis Seedlings
Trehalose (a-D-glucosyl-[1,1]-a-D-glucopyranoside) is involved in mechanisms that coordinate metabolism with plant growth adaptation and development. The main objective of the current work was to find out whether trehalose feeding affects the expression of two genes involved in photosynthesis: one gene coding for photosystem1 subunit H (PS1-H) and the other for the light harvesting complex B1 (...
متن کاملCloning and sequencing of cDNAs for hypothetical genes from chromosome 2 of Arabidopsis.
About 25% of the genes in the fully sequenced and annotated Arabidopsis genome have structures that are predicted solely by computer algorithms with no support from either nucleic acid or protein homologs from other species or expressed sequence matches from Arabidopsis. These are referred to as "hypothetical genes." On chromosome 2, sequenced by The Institute for Genomic Research, there are ap...
متن کاملSNHG6 203 Transcript Could be Applied as an Auxiliary Factor for more Precise Staging of Breast Cancer
Background: Nowadays long non-coding RNAs are known as interesting functional part of the transcriptome. LncRNA SNHG6 was reported to be expressed more in breast cancer tissues than non-tumor ones. As a frequent cancer among women, breast cancer treatment needs applied biomarkers for fast prognosis and diagnosis. SNHG6 RNA and its splice variants could be considered as molecula...
متن کاملIsolation of Brassica napus MYC2 gene and analysis of its expression in response to water deficit stress
Manipulation of stress related transcription factors to improve plant stress tolerance is a major goal of current biotechnology researches. MYC2 gene encodes a key stress-related transcription factor involved in Jasmonate (JA) and abscisic acid (ABA) signaling pathways in Arabidopsis. Brassica napus, as a globally important oilseed crop, is a close relative of Arabidopsis. In the present study...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Plant physiology
دوره 139 3 شماره
صفحات -
تاریخ انتشار 2005